Abstract
The motion estimation and disparity estimation are used to remove the temporal and inter-view redundancies in multiview plus depth video coding, however, the variable block-size ME and DE make the computational complexity increase dramatically. This drawback limits it to be applied in real-time applications. In this paper, based on the mode correlations between depth video and its corresponding texture video, motion prediction and coded block pattern, we propose a fast mode decision algorithm to reduce the computational complexity of multiview depth video coding. Experimental results show that the proposed algorithm can achieve 67.18 and 69.90 % encoding time saving for even and odd views, respectively, while maintaining a comparable rate-distortion performance. In addition, with the dramatic encoding time reduction, the proposed algorithm becomes more suitable for real-time applications.
Similar content being viewed by others
References
Kauff P., Atzpadin N., Fehn C., Müller M., Schreer O., Smolic A., Tanger R.: Depth map creation and image-based rendering for advanced 3DTV services provding interoperability and scalability. Signal Process. Image Commun. 22(2), 217–234 (2007)
Mueller, K., Merkel, P., Smolic, A., Wiegand, T.: Multiview coding using AVC, ISO/IEC JTC1/SC29/WG11, Document M12945, Bangkok, Thailand (2006)
Vetro A., Wiegand T., Sullivan G.J.: Overview of the stereo and multiview video coding extensions of the H.264/MPEG-4 AVC standard. Proc. IEEE 9(4), 626–664 (2011)
Pan, Z., Kwong, S., Xu, L., Zhang, Y., Zhao, T.: Predictive and distribution-oriented fast motion estimation for H.264/AVC. J. Real Time Image Process. doi:10.1007/s11554-012-0264-7
Nieto M., Salgado L., Cabrera J., García N.: Fast mode decision on H.264/AVC baseline profile for real-time performance. J. Real Time Image Proc. 3(1–2), 61–75 (2008)
Hu S., Zhao T., Wang H., Kwong, S.: Fast inter-mode decision based on rate-distortion cost characteristics. Proc. PCM 10 2, 145–155 (2010)
Zhao T., Wang H., Kwong S., Kuo C.-C. Jay: Fast mode decision based on mode adaptation. IEEE Trans. Circuits Syst. Video Technol. 20(5), 697–704 (2010)
Zhao T., Kwong S., Wang H., Kuo C.-C. J.: H.264/SVC mode decision based on optimal stopping theory, IEEE Trans. Image Process. 21(5), 2607–2618 (2012)
Shen L., Liu Z., Liu S., Zhang Z., An, P.: Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding. IEEE Trans. Broadcast. 55(4), 761–766 (2009)
Shen L., Liu Z., Yan T., Zhang Z., An, P.: Early SKIP mode decision for MVC using inter-view correlation. Signal Process. Image Commun. 25(2), 88–93 (2010)
Zhang, Y., Kwong, S., Jiang, G., Wang, X., Yu, M.: Statistical early termination model for fast mode decision and reference frame selection in multiview video coding. IEEE Trans. Broadcast. 58(1), 10–23 (2012)
Yoon, D.-H., Ho, Y.-S.: Fast mode decision algorithm for depth codingin 3D video systems using H.264/AVC. LNCS (7088) 25–35 (2012)
Micallef, B.W., Debono, C.J., Farrugia, R.A.: Fast inter-mode decision in multi-view video plus depth coding, Proc. PCS’12, 113–116 (2012)
Peng, Z., Yu, M., Jiang, G., Shao, F., Zhang, Y., Yang, Y.: Fast macroblock mode selection algorithm for multiview depth video coding. Chinese Optics Lett. 8(2), 151–154 (2010)
Zhang, Q., An, P., Zhang, Y., Shen, L., Zhang, Z.: Low complexity multiview video plus depth coding. IEEE Trans. Consum. Electr. 1857–1865 (2011)
Merkle, P., Smolic, A., Müller, K., Wiegand, T.: Efficient prediction structure for multi-view video coding, IEEE Trans. Circuits Syst. Video Technol. 17(11), 1461–1473 (2007)
Chen, Z., Xu, J., He, Y., Zheng, J.: Fast integer-pel and fractional-pel motion estimation for H.264/AVC. Journal of visual communication and image representation 17(2), 264–290 (2006)
Pan, Z., Kwong, S.: A fast Inter-Mode decision scheme based on luminance difference for H.264/AVC. Proc. ICSSE’11, 260–263 (2011)
Zeng, H., Ma, K.-K., Cai, C.: Fast mode decision for multiview video coding using mode correlation. IEEE Trans. Circuits Syst. Video Technol. 21(11), 1659–1666 (2011)
ITU-T and ISO/IEC JTC 1: Advanced video coding for generic audiovisual services. ITU-T Recommedation H.264 and ISO/IEC 14496-10 (MPEG-4 AVC), (2010)
Chen, B.-Y., Yang, S.-H.: Using H.264 Coded block patterns for fast inter-mode selection. Proc. ICME’08, 721–724 (2008)
Vetro, A., Pandit, P., Kimata, H., Smolic, A., Wang, Y.-K.: Joint Draft 8.0 on Multiview Video Coding. Document JVT-AB204, 28th Meeting, Hannover, DE (2008)
Bjontegaard, G.: Calculation of average PSNR differences between RD-curves. Document VCEG-M33, Thirteenth Meeting, Austin, Texas, USA (2001)
JVT H.264/AVC reference software version JM14.1. http://iphome.hhi.de/suehring/tml/download/. Accessed 17 Dec 2012
Acknowledgments
This work was supported in part by the Natural Science Foundation of China under Grants 61272289, 61102088 and in part by the Guangdong Provincial Nature Science Foundation under Grant S2012010008457, Shenzhen Emerging Industries of Strategic Basic Research Project under Grant JCYJ20120617151719115.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Pan, Z., Zhang, Y. & Kwong, S. Fast mode decision based on texture–depth correlation and motion prediction for multiview depth video coding. J Real-Time Image Proc 11, 27–36 (2016). https://doi.org/10.1007/s11554-013-0328-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11554-013-0328-3