A novel compact yet rich key frame creation method for compressed video summarization

Fei, Mengjuan; Jiang, Wei; Mao, Weijie

doi:10.1007/s11042-017-4843-2

A novel compact yet rich key frame creation method for compressed video summarization

Published: 05 June 2017

Volume 77, pages 11957–11977, (2018)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

528 Accesses
14 Citations
Explore all metrics

Abstract

Video summarization has great potential to enable rapid browsing and efficient video indexing in many applications. In this study, we propose a novel compact yet rich key frame creation method for compressed video summarization. First, we directly extract DC coefficients of I frame from a compressed video stream, and DC-based mutual information is computed to segment the long video into shots. Then, we select shots with static background and moving object according to the intensity and range of motion vector in the video stream. Detecting moving object outliers in each selected shot, the optimal object set is then selected by importance ranking and solving an optimum programming problem. Finally, we conduct an improved KNN matting approach on the optimal object outliers to automatically and seamlessly splice these outliers to the final key frame as video summarization. Previous video summarization methods typically select one or more frames from the original video as the video summarization. However, these existing key frame representation approaches for video summarization eliminate the time axis and lose the dynamic aspect of the video scene. The proposed video summarization preserves both compactness and considerably richer information than previous video summaries. Experimental results indicate that the proposed key frame representation not only includes abundant semantics but also is natural, which satisfies user preferences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Attention-based misaligned spatiotemporal auto-encoder for video anomaly detection

Article 13 April 2024

Video shot-boundary detection: issues, challenges and solutions

Article Open access 30 March 2024

Video anomaly detection based on attention and efficient spatio-temporal feature extraction

Article 04 April 2024

References

http://butlermovies.blogspot.com/2014/02/frozen-2013-movie-free-download.html (accessed 2016. 02.22)
https://www.youtube.com/watch?v=RpXw8eV07_w (accessed 2016.02.24)
ftp://ftp.pets.rdg.ac.uk/pub/PETS2000/ (accessed 2016.02.24)
Acha AR, Pritch Y, Peleg S (2006) Making a long video short: Dynamic video synopsis Proceedings CVPR, pp 435–441
Google Scholar
Aner-Wolf A, Kender JR (2004) Video summaries and cross-referencing through mosaic-based representation. Comput Vis Image Underst 95(2):201–237
Article Google Scholar
Avila SEFD, Lopes APB, Luz AD et al (2011) VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method. Pattern Recogn Lett 32(1):56–68
Article Google Scholar
Baber J, Afzulpurkar N, Dailey MN et al (2012) Shot boundary detection from videos using entropy and local descriptor 17th International Conference on IEEE, pp 1–6
Google Scholar
Cernekova Z, Pitas I, Nikou C (2006) Information theory-based shot cut/fade detection and video summarization. IEEE Trans Circuits Syst Video Technol 16(1):82–91
Article Google Scholar
Chen Q, Li D, Tang CK (2013) KNN Matting. IEEE Trans on PAMI 35 (9):2175–2188
Article Google Scholar
Cheng MM, Zhang GX, Mitra NJ et al (2015) Global contrast based salient region detection. IEEE Trans on PAMI 37(3):569–582
Article Google Scholar
Cong Y, Yuan JS, Luo JB (2012) Towards scalable summarization of consumer videos via sparse dictionary selection. IEEE Trans Multimedia 14(1):66–75
Article Google Scholar
Dhagdi MST, Deshmukh PR (2012) Keyframe based video summarization using automatic threshold and edge matching rate. International Journal of Scientific and Research Publication 2(7):1– 12
Google Scholar
Ejaz N, Mehmood I, Baik SW (2014) Feature aggregation based visual attention model for video summarization. Comput Electr Eng 40(3):993–1005
Article Google Scholar
Furini M, Geraci F, Montangero M et al (2010) STIMO: STILl and MOving video storyboard for the web scenario. Multimedia Tools and Applications 46(1):47–69
Article Google Scholar
Gong B, Chao WL, Grauman K et al (2014) Diverse sequential subset selection for supervised video summarization. Advances in Neural Information Processing Systems :2069–2077
Guan G, Wang Z, Lu S et al (2013) Keypoint based keyframe selection. IEEE Trans Circuits Syst Video Technol 23(4):729–734
Article Google Scholar
Gygli M, Grabner H, Gool LV (2015) Video summarization by learning submodular mixtures of objectives Proceedings CVPR, pp 3090–3098
Google Scholar
Gygli M, Grabner H, Riemenschneider H et al (2014) Creating summaries from user videos Proceedings ECCV, pp 505–520
Google Scholar
Ioannidis A, Chasanis V, Likas A (2016) Weighted multi-view key-frame extraction[J]. Pattern Recogn Lett 72:52–61
Article Google Scholar
Kirkpatrick S, Gelatt CD, Vecchi MP (1983) Optimization by simulated annealing. Science, New Series 220(4598):671–680
MathSciNet MATH Google Scholar
Khosla A, Hamid R, Lin CJ et al (2014) Large-scale video summarization using web-image priors Proceedings CVPR, pp 2698–2705
Google Scholar
Kolmogorov V, Zabih R (2002) What energy functions can be minimized via graph cuts?[J]. IEEE Trans on PAMI 26(2):147–159
Article MATH Google Scholar
Li L, Zhou K, Xue GR et al (2011) Video summarization via transferrable structured learning International conference on world wide web, WWW 2011, hyderabad, India, March 28 - April, pp 287–296
Google Scholar
Lienhart R, Pfeiffer S, Effelsberg W (1997) Video abstracting. Commun ACM 40(12):54–62
Article Google Scholar
Liu YL, Xiao Y (2013) A robust image hashing algorithm resistant against geometrical attacks. Radio Eng 22(4):1072–1081
MathSciNet Google Scholar
Mei S, Guan G, Wang Z et al (2015) Video summarization via minimum sparse reconstruction. Pattern Recogn 48(2):522–533
Article Google Scholar
Mundur P, Rao Y, Yesha Y (2006) Keyframe-based video summarization using Delaunay clustering. Int J Digit Libr 6(2):219–232
Article Google Scholar
Ngo CW, Pong TC, Zhang HJ (2002) Motion-based video representation for scene change detection. Int J Comput Vis 50(2):127–142
Article MATH Google Scholar
Nie Y, Xiao C, Sun H et al (2012) Compact video synopsis via global spatiotemporal optimization. In IEEE Trans Vis Comput Graph 19(10):1664–1676
Article Google Scholar
Panagiotakis C, Doulamis A, Tziritas G (2009) Equivalent key frames selection based on iso-content principles. IEEE Trans Circuits Syst Video Technol 19(3):447–451
Article Google Scholar
Panagiotakis C, Ovsepian N, Michael E (2013) Video synopsis based on a sequential distortion minimization method International Conference on Computer Analysis of Images and Patterns Springer Berlin Heidelberg
Google Scholar
Potapov D, Douze M, Harchaoui Z et al (2014) Category-specific video summarization Proceedings ECCV, pp 540–555
Google Scholar
Pritch Y, Rav-Acha A, Peleg S (2008) Nonchronological video synopsis and indexing. IEEE Trans on PAMI 30(11):1971–1984
Article Google Scholar
The Shawshank Redemption (1994) Full Movie. https://www.youtube.com/watch?v=lrxgVXpsmzY (accessed 2016.02.22)
Vila M, Bardera A, Xu Q et al (2013) Tsallis entropy-based information measures for shot boundary detection and keyframe selection. SIViP 7(3):507–520
Article Google Scholar
Wang J, Bhat P, Colburn RA et al (2005) Interactive video cutout. ACM Transactions on Graphics (ToG). ACM 24(3):585–594
Google Scholar
Wang M, Hong R, Li G (2012) Event driven web video summarization by tag localization and key-shot identification. IEEE Trans Multimedia 14(4):975–985
Article Google Scholar
Wu J, Zhong SH, Jiang J et al (2016) A novel clustering method for static video summarization. Multimedia Tools and Applications :1–17
Yarmohammadi H, Rahmati M, Khadivi M (2013) Content based video retrieval using information theory Proceedings Iran Conferernce Machine Vis and Image Processing
Google Scholar
Zhang Q, Yu SP, Zhou DS et al (2011) An efficient method of key-frame extraction based on a cluster algorithm. Journal of Human Kinetics 39(1):5–13
Google Scholar
Zhao B, Xing P (2014) Quasi real-time summarization for consumer videos Proceedings CVPR, pp 2513–2520
Google Scholar
Zhao L, Qi W, Li SZ et al (2000) Key-frame extraction and shot retrieval using nearest feature line (NFL) Proceedings 2000 ACM workshops on Multimedia, pp 217–220
Chapter Google Scholar
Zhou X, Yang C, Yu W (2013) Moving object detection by detecting contiguous outliers in the low-rank representation. IEEE Trans on PAMI 35(3):597–610
Article Google Scholar
Zhu X, Loy CC, Gong S (2013) Video synopsis by heterogeneous multi-source correlation Proceedings ICCV, pp 81–88
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, 310027, China
Mengjuan Fei, Wei Jiang & Weijie Mao

Authors

Mengjuan Fei
View author publications
You can also search for this author in PubMed Google Scholar
Wei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Weijie Mao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Jiang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fei, M., Jiang, W. & Mao, W. A novel compact yet rich key frame creation method for compressed video summarization. Multimed Tools Appl 77, 11957–11977 (2018). https://doi.org/10.1007/s11042-017-4843-2

Download citation

Received: 24 October 2016
Revised: 23 April 2017
Accepted: 17 May 2017
Published: 05 June 2017
Issue Date: May 2018
DOI: https://doi.org/10.1007/s11042-017-4843-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A novel compact yet rich key frame creation method for compressed video summarization

Abstract

Access this article

Similar content being viewed by others

Attention-based misaligned spatiotemporal auto-encoder for video anomaly detection

Video shot-boundary detection: issues, challenges and solutions

Video anomaly detection based on attention and efficient spatio-temporal feature extraction

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A novel compact yet rich key frame creation method for compressed video summarization

Abstract

Access this article

Similar content being viewed by others

Attention-based misaligned spatiotemporal auto-encoder for video anomaly detection

Video shot-boundary detection: issues, challenges and solutions

Video anomaly detection based on attention and efficient spatio-temporal feature extraction

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation