Review of the current and future technologies for video compression

Yu, Lu; Wang, Jian-peng

doi:10.1631/jzus.C0910684

Review of the current and future technologies for video compression

Hot Topic
Published: 23 December 2009

Volume 11, pages 1–13, (2010)
Cite this article

Journal of Zhejiang University SCIENCE C Aims and scope Submit manuscript

Lu Yu^1,2 &
Jian-peng Wang^1,2

218 Accesses
7 Citations
Explore all metrics

Abstract

Many important developments in video compression technologies have occurred during the past two decades. The block-based discrete cosine transform with motion compensation hybrid coding scheme has been widely employed by most available video coding standards, notably the ITU-T H.26x and ISO/IEC MPEG-x families and video part of China audio video coding standard (AVS). The objective of this paper is to provide a review of the developments of the four basic building blocks of hybrid coding scheme, namely predictive coding, transform coding, quantization and entropy coding, and give theoretical analyses and summaries of the technological advancements. We further analyze the development trends and perspectives of video compression, highlighting problems and research directions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Ahmed, N., Natarajan, T., Rao, K.R., 1974. Discrete cosine transform. IEEE Trans. Comput., C-23(1):90–93. [doi:10.1109/T-C.1974.223784]
Article MathSciNet Google Scholar
Boyce, J.M., 2004. Weighted Prediction in the H.264/MPEG AVC Video Coding Standard. Proc. Int. Symp. on Circuits and Systems, 3:789–792. [doi:10.12109/ISCAS.2004.1328865]
Google Scholar
Chen, P., Ye, Y., Karczewicz, M., 2008. Video Coding Using Extended Block Sizes. ITU-T Q.6/SG16 VCEG, VCEGAJ23, San Diego, USA.
Google Scholar
Cover, T.M., Thomas, J.A., 2003. Elements of Information Theory. Tsinghua University Press, Beijing, China, p.234–237.
Google Scholar
Girod, B., 1987. The efficiency of motion-compensating prediction for hybrid coding of video sequences. IEEE J. Sel. Areas Commun., 5(7):1140–1154. [doi:10.1109/JSAC.1987.1146632]
Article Google Scholar
Girod, B., Flierl, M., 2002. Multi-Frame Motion-Compensated Video Compression for the Digital Set-Top Box. Int. Conf. on Image Processing, 2:1–4. [doi:10.1109/ICIP.2002.1039870]
Google Scholar
Guo, X., Huang, Y., Lei, S., 2009. Ordered Entropy Slices for Parallel CABAC. ITU-T Q.6/SG16 VCEG, VCEG-AK25, Yokohama, Japan.
Google Scholar
Guo, Y., Wang, Y., Li, H., 2008. Priority-Based Template Matching Intra Prediction. IEEE Int. Conf. on Multimedia and Expo, p.1117–1120. [doi:10.1109/ICME.2008.4607635]
Hinds, A.T., Reznik, Y.A., Yu, L., Ni, Z., Zhang, C., 2007. Drift analysis for integer IDCT. SPIE, 6696: Article 14, p.1–16. [doi:10.1117/12.740220]
Google Scholar
Huffman, D., 1952. A method for the construction of minimum redundancy codes. Proc. IRE, 40(9):1098–1101. [doi:10.1109/JRPROC.1952.273898]
Article Google Scholar
ISO/IEC JTC 1, 1993. Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to About 1.5 Mbits/s-Part 2: Video. ISO/IEC 11172-2 (MPEG-1 Part 2). Geneva, Switzerland.
ISO/IEC JTC 1, 1999. Coding of Audio-Visual Objects-Part 2: Visual. ISO/IEC 14496-2 (MPEG-4 Part 2). Geneva, Switzerland.
ISO/IEC JTC 1, 2008. Fixed-Point 8×8 Inverse Discrete Cosine Transform and Discrete Cosine Transform. Information Technology-MPEG Video Technologies-Part 2: ISO/IEC 23002-2. Geneva, Switzerland.
ITU-T, 1993. Video Codec for Audiovisual Services at px64 kbits/s. ITU-T Rec. H.261. Geneva, Switzerland.
ITU-T, 2000. Video Coding for Low Bit Rate Communication. ITU-T Rec. H.263. Geneva, Switzerland.
ITU-T and ISO/IEC, 1992. Digital Compression and Coding of Continuous-Tone Still Images. ITU-T Rec. T.81 and ISO/IEC 10918-1. Geneva, Switzerland.
ITU-T and ISO/IEC JTC 1, 1994. Generic Coding of Moving Pictures and Associated Audio Information-Part 2: Video. ITU-T Rec. H.262 and ISO/IEC 13818-2 (MPEG-2 Part 2). Geneva, Switzerland.
ITU-T and ISO/IEC JTC 1, 2000. JPEG2000 Image Coding System. ITU-T Rec. T.800 and ISO/IEC 15444-1. Geneva, Switzerland.
Jain, J.R., Jain, A.K., 1981. Displacement measurement and its application in interframe image coding. IEEE Trans. Commun., 29(12):1799–1808. [doi:10.1109/TCOM.1981.1094950]
Article Google Scholar
Jayant, N.S., Noll, P., 1984. Digital Coding of Waveforms. Prentice-Hall, Englewood Cliffs, New Jersey, p.62–64, 524–546.
Google Scholar
Jiang, W., Wang, J., Sun, J., 2005. Rate-distortion based quantization level adjustment for H.264. Electron. Lett., 41(16):903. [doi:10.1049/el:20051644]
Article Google Scholar
Kamp, S., Evertz, M., Wien, M., 2008. Decoder Side Motion Vector Derivation for Inter Frame Video Coding. 15th IEEE Int. Conf. on Image Processing, p.1120–1123. [doi:10.1109/ICIP.2008.4711956]
Kamp, S., Bross, B., Wien, M., 2009. Fast Decoder Side Motion Vector Derivation for Inter Frame Video Coding. Picture Coding Symp., p.1–4. [doi:10.1109/PCS.2009.5167453]
Karczewicz, M., Nieweglowski, J., Lainema, J., Kalevo, O., 1996. Video Coding Using Motion Compensation with Polynomial Motion Vector Fields. 1st Int. Workshop on Wireless Image/Video Communications, p.26–31. [doi:10.1109/WIVC.1996.624638]
Karczewicz, M., Ye, Y., Chong, I., 2008. Rate Distortion Optimized Quantization. ITU-T Q.6/SG16 VCEG, VCEG-AH21, Antalya, Turkey.
Google Scholar
Kauff, P., Makai, B., Rauthenberg, S., Golz, U., de Lameillieure, J.L.P., Sikora, T., 1997. Functional coding of video using a shape-adaptive DCT algorithm and object-based motion prediction toolbox. IEEE Trans. Circ. Syst. Video Technol., 7(1):181–196. [doi:10.1109/76.554429]
Article Google Scholar
Kim, J., Na, T., Kim, C., Lee, B., Kim, M., 2008. Enlarging MB Size for High Fidelity Video Coding Beyond HD. ITU-T Q.6/SG16 VCEG, VCEG-AJ21, San Diego, USA.
Google Scholar
Lee, D.T., 2005. JPEG 2000: retrospective and new developments. Proc. IEEE, 93(1):32–41. [doi:10.1109/JPROC.2004.839613]
Article Google Scholar
Malvar, H.S., Hallapuro, A., Karczewicz, M., Kerofsky, L., 2003. Low-complexity transform and quantization in H.264/AVC. IEEE Trans. Circ. Syst. Video Technol., 13(7):598–603. [doi:10.1109/TCSVT.2003.814964]
Article Google Scholar
Marpe, D., Schwarz, H., Wiegand, T., 2003. Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard. IEEE Trans. Circ. Syst. Video Technol., 13(7):620–636. [doi:10.1109/TCSVT.2003.815173]
Article Google Scholar
Narroschke, M., 2006. Extending H.264/AVC by an Adaptive Coding of the Prediction Error. The 25th Picture Coding Symp., O5-3.
Ortega, A., Ramchandran, K., 1998. Rate-distortion methods for image and video compression. IEEE Signal Process. Mag., 15(6):23–50. [doi:10.1109/79.733495]
Article Google Scholar
Ostermann, J., Narroschke, M., 2006. Motion Compensated Prediction with 1/8-Pel Displacement Vector Resolution. ITU-T Q.6/SG16 VCEG, VCEG-AD09. Hangzhou, China.
Ray, W., Driver, R.M., 1970. Further decomposition of the Karhunen-Loève series representation of a stationary random process. IEEE Trans. Inf. Theory, 16(6):663–668. [doi:10.1109/TIT.1970.1054565]
Article MATH MathSciNet Google Scholar
Rusanovskyy, D., Ugur, K., Gabbouj, M., Lainema, J., 2008. Video Coding with Pixel-Aligned Directional Adaptive Interpolation Filters. IEEE Int. Symp. on Circuits and Systems, p.704–707. [doi:10.1109/ISCAS.2008.4541515]
Rusanovskyy, D., Ugur, K., Hallapuro, A., Lainema, J., Gabbouj, M., 2009. Video coding with low-complexity directional adaptive interpolation filters. IEEE Trans. Circ. Syst. Video Technol., 19(8):1239–1243. [doi:10.1109/TCSVT.2009.2022708]
Article Google Scholar
Schwarz, H., Marpe, D., Wiegand, T., 2006. Analysis of Hierarchical B Pictures and MCTF. IEEE Int. Conf. on Multimedia and Expo, p.1929–1932. [doi:10.1109/ICME.2006.262934]
Segall, A., Zhao, J., 2008. Entropy Slices for Parallel Entropy Decoding. ITU-T SGI 6/Q.6 Doc. COM16-C405. Geneva, Switzerland.
Shannon, C.E., 1948. A mathematical theory of communication. Bell Syst. Techn. J., 27:379–423, 623–656.
MATH MathSciNet Google Scholar
Shannon, C.E., 1959. Coding Theorems for a Discrete Source with a Fidelity Criterion. IRE National Convention Record, Part 4, p.142–163.
Shiodera, T., Tanizawa, A., Chujoh, T., 2007. Block Based Extra/Inter-Polating Prediction for Intra Coding. IEEE Int. Conf. on Image Processing, 6:445–448. [doi:10.1109/ICIP.2007.4379617]
Google Scholar
Smolic, A., Makai, B., Sikora, T., 1999a. Real-time estimation of long-term 3-D motion parameters for SNHC face animation and model-based coding applications. IEEE Trans. Circ. Syst. Video Technol., 9(2):255–263. [doi:10.1109/76.752093]
Article Google Scholar
Smolic, A., Sikora, T., Ohm, J.R., 1999b. Long-term global motion estimation and its application for sprite coding, content description, and segmentation. IEEE Trans. Circ. Syst. Video Technol., 9(8):1227–1242. [doi:10.1109/76.809158]
Article Google Scholar
Sullivan, G.J., Sun, S., 2005. On dead-zone plus uniform threshold scalar quantization. SPIE, 5960: Article 33, p.1–14. [doi:10.1117/12.631550]
Google Scholar
Sullivan, G.J., Wiegand, T., 1998. Rate-distortion optimization for video compression. IEEE Signal Process. Mag., 15(6):74–90. [doi:10.1109/79.733497]
Article Google Scholar
Sze, V., Demircin, M.U., Budagavi, M., 2008. CABAC Throughput Requirements for Real-Time Decoding. ITU-T Q.6/SG16 Doc. VCEG-AJ31. San Diego, USA.
Tan, K.T., Ghanbari, M., 2000. A multi-metric objective picture-quality measurement model for MPEG video. IEEE Trans. Circ. Syst. Video Technol., 10(7):1208–1213. [doi:10.1109/76.875525]
Article Google Scholar
Tan, T.K., Boon, C.S., Suzuki, Y., 2006. Intra Prediction by Template Matching. IEEE Int. Conf. on Image Processing, p.1693–1696. [doi:10.1109/ICIP.2006.312685]
Tsukuba, T., Yamamoto T., Tokumo Y., Aono T., 2007. Adaptive Multidirectional Intra Prediction. ITU-T Q.6/SG16 VCEG, VCEG-AG05, Shenzhen, China.
Ugur, K., Lainema, J., Gabbouj, M., 2007. Adaptive Interpolation Filter with Flexible Symmetry for Coding High Resolution High Quality Video. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, p.1013–1016. [doi:10.1109/ICASSP.2007.366082]
van den Branden Lambrecht, C.J., Verscheure, O., 1996. Perceptual quality measure using a spatiotemporal model of the human visual system. SPIE, 2668:450–461. [doi:10.1117/12.235440]
Article Google Scholar
Vatis, Y., Ostermann, J., 2006. Locally Adaptive Non-Separable Interpolation Filter for H.264-AVC. IEEE Int. Conf. on Image Processing, p.33–36. [doi:10.1109/ICIP.2006. 313148]
Vatis, Y., Ostermann, J., 2009. Adaptive interpolation filter for H.264/AVC. IEEE Trans. Circ. Syst. Video Technol., 19(2):179–192. [doi:10.1109/TCSVT.2008.2009259]
Article Google Scholar
Vatis, Y., Edler, B., Nguyen, D.T., Ostermann, J., 2005. Motion- and Aliasing-Compensated Prediction Using a Two-Dimensional Non-Separable Adaptive Wiener Interpolation Filter. ICIP IEEE Int. Conf. on Image Processing, 2:894–897. [doi:10.1109/ICIP.2005.1530200]
Google Scholar
Vetterli, M., Kovacevic, J., 1995. Wavelets and Subband Coding. Prentice-Hall, Englewood Cliffs, New Jersey, p.414–464.
MATH Google Scholar
Wedi, T., 2002. Adaptive Interpolation Filter for Motion Compensated Prediction. Int. Conf. on Image Processing, p.509–512. [doi:10.1109/ICIP.2002.1039999]
Wedi, T., 2006. Adaptive interpolation filters and high-resolution displacements for video coding. IEEE Trans. Circ. Syst. Video Technol., 16(4):484–491. [doi:10.1109/TCSVT.2006.870856]
Article Google Scholar
Wen, J., Luttrell, M., Villasenor, J., 2000. Trellis-based R-D optimal quantization in H.263+. IEEE Trans. Image Process., 9(8):1431–1434. [doi:10.1109/83.855437]
Article Google Scholar
Wiegand, T., Zhang, X., Girod, B., 1999. Long-term memory motion-compensated prediction. IEEE Trans. Circ. Syst. Video Technol., 9(1):70–84. [doi:10.1109/76.744276]
Article Google Scholar
Wiegand, T., Sullivan, G.J., Bjontegaard, G., Luthra, A., 2003a. Overview of the H.264/AVC video coding standard. IEEE Trans. Circ. Syst. Video Technol., 13(7):560–576. [doi:10.1109/TCSVT.2003.815165]
Article Google Scholar
Wiegand, T., Schwarz, H., Joch, A., Kossentini, F., Sullivan, G.J., 2003b. Rate-constrained coder control and comparison of video coding standards. IEEE Trans. Circ. Syst. Video Technol., 13(7):688–703. [doi:10.1109/TCSVT.2003.815168]
Article Google Scholar
Wien, M., 2003. Variable block-size transforms for H.264/AVC. IEEE Trans. Circ. Syst. Video Technol., 13(7):604–613. [doi:10.1109/TCSVT.2003.815380]
Article Google Scholar
Wittmann, S., Wedi, T., 2008. Separable Adaptive Interpolation Filter for Video Coding. 15th IEEE Int. Conf. on Image Processing, p.2500–2503. [doi:10.1109/ICIP.2008.4712301]
Won, K., Yang, J., Jeon, B., 2009. Motion Vector Coding Using Decoder-Side Estimation of Motion Vector. IEEE Int. Symp. on Broadband Multimedia Systems and Broadcasting, p.1–4. [doi:10.1109/ISBMSB.2009.5133843]
Wu, H., Yu, Z., Winkler, S., Chen, T., 2001. Impairment Metrics for MC/DPCM/DCT Encoded Digital Video. 22nd Picture Coding Symp., p.129–131.
Ye, Y., Karczewicz, M., 2008. Improved H.264 Intra Coding Based on Bi-Directional Intra Prediction, Directional Transform, and Adaptive Coefficient Scanning. 15th IEEE Int. Conf. on Image Processing, p.2116–2119. [doi:10.1109/ICIP.2008.4712205]
Yu, L., Chen, S., Wang, J., 2009. Overview of AVS video coding standards. Signal Process.: Image Commun., 24(4):247–262. [doi:10.1016/j.image.2009.02.003]
Article Google Scholar
Zhang, C., Yu, L., Lou, J., Cham, W., Dong, J., 2008. The technique of prescaled integer transform: concept, design and applications. IEEE Trans. Circ. Syst. Video Technol., 18(1):84–97. [doi:10.1109/TCSVT.2007.913749]
Article Google Scholar
Zheng, Y., Yin, P., Escoda, O.D., Li, X., Gomila, C., 2008. Intra Prediction Using Template Matching with Adaptive Illumination Compensation. 15th IEEE Int. Conf. on Image Processing, p.125–128. [doi:10.1109/ICIP.2008.4711707]

Download references

Author information

Authors and Affiliations

Institute of Information and Communication Engineering, Zhejiang University, Hangzhou, 310027, China
Lu Yu & Jian-peng Wang
Key Laboratory of Integrated Information Network Technology of Zhejiang Province, Hangzhou, 310027, China
Lu Yu & Jian-peng Wang

Authors

Lu Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jian-peng Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lu Yu.

Additional information

Project (No. 2009CB320903) supported by the National Basic Research Program (973) of China

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yu, L., Wang, Jp. Review of the current and future technologies for video compression. J. Zhejiang Univ. - Sci. C 11, 1–13 (2010). https://doi.org/10.1631/jzus.C0910684

Download citation

Received: 09 November 2009
Accepted: 16 November 2009
Published: 23 December 2009
Issue Date: January 2010
DOI: https://doi.org/10.1631/jzus.C0910684

Key words

CLC number

TN919.8

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Review of the current and future technologies for video compression

Abstract

Access this article

Similar content being viewed by others

Video steganography: recent advances and challenges

A novel watermarking method based on differential evolutionary algorithm and wavelet transform

Perceptual image quality assessment: a survey

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Key words

CLC number

Navigation

Review of the current and future technologies for video compression

Abstract

Access this article

Similar content being viewed by others

Video steganography: recent advances and challenges

A novel watermarking method based on differential evolutionary algorithm and wavelet transform

Perceptual image quality assessment: a survey

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Search

Navigation