Skip to main content
Log in

Review of the current and future technologies for video compression

  • Hot Topic
  • Published:
Journal of Zhejiang University SCIENCE C Aims and scope Submit manuscript

Abstract

Many important developments in video compression technologies have occurred during the past two decades. The block-based discrete cosine transform with motion compensation hybrid coding scheme has been widely employed by most available video coding standards, notably the ITU-T H.26x and ISO/IEC MPEG-x families and video part of China audio video coding standard (AVS). The objective of this paper is to provide a review of the developments of the four basic building blocks of hybrid coding scheme, namely predictive coding, transform coding, quantization and entropy coding, and give theoretical analyses and summaries of the technological advancements. We further analyze the development trends and perspectives of video compression, highlighting problems and research directions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  • Ahmed, N., Natarajan, T., Rao, K.R., 1974. Discrete cosine transform. IEEE Trans. Comput., C-23(1):90–93. [doi:10.1109/T-C.1974.223784]

    Article  MathSciNet  Google Scholar 

  • Boyce, J.M., 2004. Weighted Prediction in the H.264/MPEG AVC Video Coding Standard. Proc. Int. Symp. on Circuits and Systems, 3:789–792. [doi:10.12109/ISCAS.2004.1328865]

    Google Scholar 

  • Chen, P., Ye, Y., Karczewicz, M., 2008. Video Coding Using Extended Block Sizes. ITU-T Q.6/SG16 VCEG, VCEGAJ23, San Diego, USA.

    Google Scholar 

  • Cover, T.M., Thomas, J.A., 2003. Elements of Information Theory. Tsinghua University Press, Beijing, China, p.234–237.

    Google Scholar 

  • Girod, B., 1987. The efficiency of motion-compensating prediction for hybrid coding of video sequences. IEEE J. Sel. Areas Commun., 5(7):1140–1154. [doi:10.1109/JSAC.1987.1146632]

    Article  Google Scholar 

  • Girod, B., Flierl, M., 2002. Multi-Frame Motion-Compensated Video Compression for the Digital Set-Top Box. Int. Conf. on Image Processing, 2:1–4. [doi:10.1109/ICIP.2002.1039870]

    Google Scholar 

  • Guo, X., Huang, Y., Lei, S., 2009. Ordered Entropy Slices for Parallel CABAC. ITU-T Q.6/SG16 VCEG, VCEG-AK25, Yokohama, Japan.

    Google Scholar 

  • Guo, Y., Wang, Y., Li, H., 2008. Priority-Based Template Matching Intra Prediction. IEEE Int. Conf. on Multimedia and Expo, p.1117–1120. [doi:10.1109/ICME.2008.4607635]

  • Hinds, A.T., Reznik, Y.A., Yu, L., Ni, Z., Zhang, C., 2007. Drift analysis for integer IDCT. SPIE, 6696: Article 14, p.1–16. [doi:10.1117/12.740220]

    Google Scholar 

  • Huffman, D., 1952. A method for the construction of minimum redundancy codes. Proc. IRE, 40(9):1098–1101. [doi:10.1109/JRPROC.1952.273898]

    Article  Google Scholar 

  • ISO/IEC JTC 1, 1993. Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to About 1.5 Mbits/s-Part 2: Video. ISO/IEC 11172-2 (MPEG-1 Part 2). Geneva, Switzerland.

  • ISO/IEC JTC 1, 1999. Coding of Audio-Visual Objects-Part 2: Visual. ISO/IEC 14496-2 (MPEG-4 Part 2). Geneva, Switzerland.

  • ISO/IEC JTC 1, 2008. Fixed-Point 8×8 Inverse Discrete Cosine Transform and Discrete Cosine Transform. Information Technology-MPEG Video Technologies-Part 2: ISO/IEC 23002-2. Geneva, Switzerland.

  • ITU-T, 1993. Video Codec for Audiovisual Services at px64 kbits/s. ITU-T Rec. H.261. Geneva, Switzerland.

  • ITU-T, 2000. Video Coding for Low Bit Rate Communication. ITU-T Rec. H.263. Geneva, Switzerland.

  • ITU-T and ISO/IEC, 1992. Digital Compression and Coding of Continuous-Tone Still Images. ITU-T Rec. T.81 and ISO/IEC 10918-1. Geneva, Switzerland.

  • ITU-T and ISO/IEC JTC 1, 1994. Generic Coding of Moving Pictures and Associated Audio Information-Part 2: Video. ITU-T Rec. H.262 and ISO/IEC 13818-2 (MPEG-2 Part 2). Geneva, Switzerland.

  • ITU-T and ISO/IEC JTC 1, 2000. JPEG2000 Image Coding System. ITU-T Rec. T.800 and ISO/IEC 15444-1. Geneva, Switzerland.

  • Jain, J.R., Jain, A.K., 1981. Displacement measurement and its application in interframe image coding. IEEE Trans. Commun., 29(12):1799–1808. [doi:10.1109/TCOM.1981.1094950]

    Article  Google Scholar 

  • Jayant, N.S., Noll, P., 1984. Digital Coding of Waveforms. Prentice-Hall, Englewood Cliffs, New Jersey, p.62–64, 524–546.

    Google Scholar 

  • Jiang, W., Wang, J., Sun, J., 2005. Rate-distortion based quantization level adjustment for H.264. Electron. Lett., 41(16):903. [doi:10.1049/el:20051644]

    Article  Google Scholar 

  • Kamp, S., Evertz, M., Wien, M., 2008. Decoder Side Motion Vector Derivation for Inter Frame Video Coding. 15th IEEE Int. Conf. on Image Processing, p.1120–1123. [doi:10.1109/ICIP.2008.4711956]

  • Kamp, S., Bross, B., Wien, M., 2009. Fast Decoder Side Motion Vector Derivation for Inter Frame Video Coding. Picture Coding Symp., p.1–4. [doi:10.1109/PCS.2009.5167453]

  • Karczewicz, M., Nieweglowski, J., Lainema, J., Kalevo, O., 1996. Video Coding Using Motion Compensation with Polynomial Motion Vector Fields. 1st Int. Workshop on Wireless Image/Video Communications, p.26–31. [doi:10.1109/WIVC.1996.624638]

  • Karczewicz, M., Ye, Y., Chong, I., 2008. Rate Distortion Optimized Quantization. ITU-T Q.6/SG16 VCEG, VCEG-AH21, Antalya, Turkey.

    Google Scholar 

  • Kauff, P., Makai, B., Rauthenberg, S., Golz, U., de Lameillieure, J.L.P., Sikora, T., 1997. Functional coding of video using a shape-adaptive DCT algorithm and object-based motion prediction toolbox. IEEE Trans. Circ. Syst. Video Technol., 7(1):181–196. [doi:10.1109/76.554429]

    Article  Google Scholar 

  • Kim, J., Na, T., Kim, C., Lee, B., Kim, M., 2008. Enlarging MB Size for High Fidelity Video Coding Beyond HD. ITU-T Q.6/SG16 VCEG, VCEG-AJ21, San Diego, USA.

    Google Scholar 

  • Lee, D.T., 2005. JPEG 2000: retrospective and new developments. Proc. IEEE, 93(1):32–41. [doi:10.1109/JPROC.2004.839613]

    Article  Google Scholar 

  • Malvar, H.S., Hallapuro, A., Karczewicz, M., Kerofsky, L., 2003. Low-complexity transform and quantization in H.264/AVC. IEEE Trans. Circ. Syst. Video Technol., 13(7):598–603. [doi:10.1109/TCSVT.2003.814964]

    Article  Google Scholar 

  • Marpe, D., Schwarz, H., Wiegand, T., 2003. Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard. IEEE Trans. Circ. Syst. Video Technol., 13(7):620–636. [doi:10.1109/TCSVT.2003.815173]

    Article  Google Scholar 

  • Narroschke, M., 2006. Extending H.264/AVC by an Adaptive Coding of the Prediction Error. The 25th Picture Coding Symp., O5-3.

  • Ortega, A., Ramchandran, K., 1998. Rate-distortion methods for image and video compression. IEEE Signal Process. Mag., 15(6):23–50. [doi:10.1109/79.733495]

    Article  Google Scholar 

  • Ostermann, J., Narroschke, M., 2006. Motion Compensated Prediction with 1/8-Pel Displacement Vector Resolution. ITU-T Q.6/SG16 VCEG, VCEG-AD09. Hangzhou, China.

  • Ray, W., Driver, R.M., 1970. Further decomposition of the Karhunen-Loève series representation of a stationary random process. IEEE Trans. Inf. Theory, 16(6):663–668. [doi:10.1109/TIT.1970.1054565]

    Article  MATH  MathSciNet  Google Scholar 

  • Rusanovskyy, D., Ugur, K., Gabbouj, M., Lainema, J., 2008. Video Coding with Pixel-Aligned Directional Adaptive Interpolation Filters. IEEE Int. Symp. on Circuits and Systems, p.704–707. [doi:10.1109/ISCAS.2008.4541515]

  • Rusanovskyy, D., Ugur, K., Hallapuro, A., Lainema, J., Gabbouj, M., 2009. Video coding with low-complexity directional adaptive interpolation filters. IEEE Trans. Circ. Syst. Video Technol., 19(8):1239–1243. [doi:10.1109/TCSVT.2009.2022708]

    Article  Google Scholar 

  • Schwarz, H., Marpe, D., Wiegand, T., 2006. Analysis of Hierarchical B Pictures and MCTF. IEEE Int. Conf. on Multimedia and Expo, p.1929–1932. [doi:10.1109/ICME.2006.262934]

  • Segall, A., Zhao, J., 2008. Entropy Slices for Parallel Entropy Decoding. ITU-T SGI 6/Q.6 Doc. COM16-C405. Geneva, Switzerland.

  • Shannon, C.E., 1948. A mathematical theory of communication. Bell Syst. Techn. J., 27:379–423, 623–656.

    MATH  MathSciNet  Google Scholar 

  • Shannon, C.E., 1959. Coding Theorems for a Discrete Source with a Fidelity Criterion. IRE National Convention Record, Part 4, p.142–163.

  • Shiodera, T., Tanizawa, A., Chujoh, T., 2007. Block Based Extra/Inter-Polating Prediction for Intra Coding. IEEE Int. Conf. on Image Processing, 6:445–448. [doi:10.1109/ICIP.2007.4379617]

    Google Scholar 

  • Smolic, A., Makai, B., Sikora, T., 1999a. Real-time estimation of long-term 3-D motion parameters for SNHC face animation and model-based coding applications. IEEE Trans. Circ. Syst. Video Technol., 9(2):255–263. [doi:10.1109/76.752093]

    Article  Google Scholar 

  • Smolic, A., Sikora, T., Ohm, J.R., 1999b. Long-term global motion estimation and its application for sprite coding, content description, and segmentation. IEEE Trans. Circ. Syst. Video Technol., 9(8):1227–1242. [doi:10.1109/76.809158]

    Article  Google Scholar 

  • Sullivan, G.J., Sun, S., 2005. On dead-zone plus uniform threshold scalar quantization. SPIE, 5960: Article 33, p.1–14. [doi:10.1117/12.631550]

    Google Scholar 

  • Sullivan, G.J., Wiegand, T., 1998. Rate-distortion optimization for video compression. IEEE Signal Process. Mag., 15(6):74–90. [doi:10.1109/79.733497]

    Article  Google Scholar 

  • Sze, V., Demircin, M.U., Budagavi, M., 2008. CABAC Throughput Requirements for Real-Time Decoding. ITU-T Q.6/SG16 Doc. VCEG-AJ31. San Diego, USA.

  • Tan, K.T., Ghanbari, M., 2000. A multi-metric objective picture-quality measurement model for MPEG video. IEEE Trans. Circ. Syst. Video Technol., 10(7):1208–1213. [doi:10.1109/76.875525]

    Article  Google Scholar 

  • Tan, T.K., Boon, C.S., Suzuki, Y., 2006. Intra Prediction by Template Matching. IEEE Int. Conf. on Image Processing, p.1693–1696. [doi:10.1109/ICIP.2006.312685]

  • Tsukuba, T., Yamamoto T., Tokumo Y., Aono T., 2007. Adaptive Multidirectional Intra Prediction. ITU-T Q.6/SG16 VCEG, VCEG-AG05, Shenzhen, China.

  • Ugur, K., Lainema, J., Gabbouj, M., 2007. Adaptive Interpolation Filter with Flexible Symmetry for Coding High Resolution High Quality Video. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, p.1013–1016. [doi:10.1109/ICASSP.2007.366082]

  • van den Branden Lambrecht, C.J., Verscheure, O., 1996. Perceptual quality measure using a spatiotemporal model of the human visual system. SPIE, 2668:450–461. [doi:10.1117/12.235440]

    Article  Google Scholar 

  • Vatis, Y., Ostermann, J., 2006. Locally Adaptive Non-Separable Interpolation Filter for H.264-AVC. IEEE Int. Conf. on Image Processing, p.33–36. [doi:10.1109/ICIP.2006. 313148]

  • Vatis, Y., Ostermann, J., 2009. Adaptive interpolation filter for H.264/AVC. IEEE Trans. Circ. Syst. Video Technol., 19(2):179–192. [doi:10.1109/TCSVT.2008.2009259]

    Article  Google Scholar 

  • Vatis, Y., Edler, B., Nguyen, D.T., Ostermann, J., 2005. Motion- and Aliasing-Compensated Prediction Using a Two-Dimensional Non-Separable Adaptive Wiener Interpolation Filter. ICIP IEEE Int. Conf. on Image Processing, 2:894–897. [doi:10.1109/ICIP.2005.1530200]

    Google Scholar 

  • Vetterli, M., Kovacevic, J., 1995. Wavelets and Subband Coding. Prentice-Hall, Englewood Cliffs, New Jersey, p.414–464.

    MATH  Google Scholar 

  • Wedi, T., 2002. Adaptive Interpolation Filter for Motion Compensated Prediction. Int. Conf. on Image Processing, p.509–512. [doi:10.1109/ICIP.2002.1039999]

  • Wedi, T., 2006. Adaptive interpolation filters and high-resolution displacements for video coding. IEEE Trans. Circ. Syst. Video Technol., 16(4):484–491. [doi:10.1109/TCSVT.2006.870856]

    Article  Google Scholar 

  • Wen, J., Luttrell, M., Villasenor, J., 2000. Trellis-based R-D optimal quantization in H.263+. IEEE Trans. Image Process., 9(8):1431–1434. [doi:10.1109/83.855437]

    Article  Google Scholar 

  • Wiegand, T., Zhang, X., Girod, B., 1999. Long-term memory motion-compensated prediction. IEEE Trans. Circ. Syst. Video Technol., 9(1):70–84. [doi:10.1109/76.744276]

    Article  Google Scholar 

  • Wiegand, T., Sullivan, G.J., Bjontegaard, G., Luthra, A., 2003a. Overview of the H.264/AVC video coding standard. IEEE Trans. Circ. Syst. Video Technol., 13(7):560–576. [doi:10.1109/TCSVT.2003.815165]

    Article  Google Scholar 

  • Wiegand, T., Schwarz, H., Joch, A., Kossentini, F., Sullivan, G.J., 2003b. Rate-constrained coder control and comparison of video coding standards. IEEE Trans. Circ. Syst. Video Technol., 13(7):688–703. [doi:10.1109/TCSVT.2003.815168]

    Article  Google Scholar 

  • Wien, M., 2003. Variable block-size transforms for H.264/AVC. IEEE Trans. Circ. Syst. Video Technol., 13(7):604–613. [doi:10.1109/TCSVT.2003.815380]

    Article  Google Scholar 

  • Wittmann, S., Wedi, T., 2008. Separable Adaptive Interpolation Filter for Video Coding. 15th IEEE Int. Conf. on Image Processing, p.2500–2503. [doi:10.1109/ICIP.2008.4712301]

  • Won, K., Yang, J., Jeon, B., 2009. Motion Vector Coding Using Decoder-Side Estimation of Motion Vector. IEEE Int. Symp. on Broadband Multimedia Systems and Broadcasting, p.1–4. [doi:10.1109/ISBMSB.2009.5133843]

  • Wu, H., Yu, Z., Winkler, S., Chen, T., 2001. Impairment Metrics for MC/DPCM/DCT Encoded Digital Video. 22nd Picture Coding Symp., p.129–131.

  • Ye, Y., Karczewicz, M., 2008. Improved H.264 Intra Coding Based on Bi-Directional Intra Prediction, Directional Transform, and Adaptive Coefficient Scanning. 15th IEEE Int. Conf. on Image Processing, p.2116–2119. [doi:10.1109/ICIP.2008.4712205]

  • Yu, L., Chen, S., Wang, J., 2009. Overview of AVS video coding standards. Signal Process.: Image Commun., 24(4):247–262. [doi:10.1016/j.image.2009.02.003]

    Article  Google Scholar 

  • Zhang, C., Yu, L., Lou, J., Cham, W., Dong, J., 2008. The technique of prescaled integer transform: concept, design and applications. IEEE Trans. Circ. Syst. Video Technol., 18(1):84–97. [doi:10.1109/TCSVT.2007.913749]

    Article  Google Scholar 

  • Zheng, Y., Yin, P., Escoda, O.D., Li, X., Gomila, C., 2008. Intra Prediction Using Template Matching with Adaptive Illumination Compensation. 15th IEEE Int. Conf. on Image Processing, p.125–128. [doi:10.1109/ICIP.2008.4711707]

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lu Yu.

Additional information

Project (No. 2009CB320903) supported by the National Basic Research Program (973) of China

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yu, L., Wang, Jp. Review of the current and future technologies for video compression. J. Zhejiang Univ. - Sci. C 11, 1–13 (2010). https://doi.org/10.1631/jzus.C0910684

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1631/jzus.C0910684

Key words

CLC number

Navigation