Skip to main content
Log in

Real-time and automatic close-up retrieval from compressed videos

  • Published:
International Journal of Automation and Computing Aims and scope Submit manuscript

Abstract

This paper proposes a thorough scheme, by virtue of camera zooming descriptor with two-level threshold, to automatically retrieve close-ups directly from moving picture experts group (MPEG) compressed videos based on camera motion analysis. A new algorithm for fast camera motion estimation in compressed domain is presented. In the retrieval process, camera-motion-based semantic retrieval is built. To improve the coverage of the proposed scheme, close-up retrieval in all kinds of videos is investigated. Extensive experiments illustrate that the proposed scheme provides promising retrieval results under real-time and automatic application scenario.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. J. Jiang, Y. Weng, P. J. Li. Dominant Colour Extraction in DCT Domain. Image and Vision Computing Journal, vol. 24, no. 12, pp. 1269–1277, 2006.

    Article  Google Scholar 

  2. J. Jiang, Y. Weng. Video Extraction for Fast Content Access to MPEG Compressed Videos. IEEE Transactions on Circuits and Systems for Video Technology, vol. 14, no. 5, pp. 595–605, 2004.

    Article  Google Scholar 

  3. J. Vendrig, M. Worring. Systematic Evaluation of Logical Story Unit Segmentation. IEEE Transactions on Multimedia, vol. 4, no. 4, pp. 492–499, 2002.

    Article  Google Scholar 

  4. H. W. Agius, M. C. Angelides. Modeling Content for Semantic-level Querying of Multimedia. Multimedia Tools and Applications, vol. 15, no. 1, pp. 5–37, 2001.

    Article  MATH  Google Scholar 

  5. T. Athanasiadis, P. Mylonas, Y. Avrithis, S. Kollias. Semantic Image Segmentation and Object Labeling. IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 3, pp. 298–312, 2007.

    Article  Google Scholar 

  6. D. Djordjevic, E. Izquierdo. An Object-and User-driven System for Semantic-based Image Annotation and Retrieval. IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 3, pp. 313–323, 2007.

    Article  Google Scholar 

  7. D. Vallet, P. Castells, M. Fernandez, P. Mylonas, Y. Avrithis. Personalized Content Retrieval in Context Using Ontological Knowledge. IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 3, pp. 336–346, 2007.

    Article  Google Scholar 

  8. J. W. Hsieh, S. L. Yu, Y. S. Chen. Motion-based Video Retrieval by Trajectory Matching. IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 3, pp. 396–409, 2006.

    Article  Google Scholar 

  9. K. W. Sze, K. M. Lam, G. Qiu. A New Key Frame Representation for Video Segment Retrieval. IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 9, pp. 1148–1155, 2005.

    Article  Google Scholar 

  10. F. Jing, M. Li, H. J. Zhang, B. Zhang. Relevance Feedback in Region-based Image Retrieval. IEEE Transactions on Circuits and Systems for Video Technology, vol. 14, no. 5, pp. 672–681, 2004.

    Article  Google Scholar 

  11. S. Antani, R. Kasturi, R. Jain. A Survey on the Use of Pattern Recognition Methods for Abstraction, Indexing and Retrieval of Images and Video. Pattern Recognition, vol. 35, no. 4, pp. 945–965, 2002.

    Article  MATH  Google Scholar 

  12. Y. H. Ho, C.W. Lin, J. F. Chen, H. Y. M. Liao. Fast Coarse-to-fine Video Retrieval Using Shot-level Spatio-temporal Statistics. IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 5, pp. 642–648, 2006.

    Article  Google Scholar 

  13. D. Farin, P. H. N. De. With. Enabling Arbitrary Rotational Camera Motion Using Multisprites with Minimum Coding Cost. IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 4, pp. 492–506, 2006.

    Article  Google Scholar 

  14. Y. Su, M. T. Sun, V. Hsu. Global Motion Estimation from Coarsely Sampled Motion Vector Field and the Applications. IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 2, pp. 232–242, 2005.

    Article  Google Scholar 

  15. J. C. Huang, W. S. Hsieh. Automatic Feature-based Global Motion Estimation in Video Sequences. IEEE Transactions on Consumer Electronics, vol. 50, no. 3, pp. 911–915, 2004.

    Article  Google Scholar 

  16. Y. P. Tan, D. D. Saur, S. R. Kulkarni, P. J. Ramadge. Rapid Estimation of Camera Motion from Compressed Video with Application to Video Annotation. IEEE Transactions on Circuits and Systems for Video Technology, vol. 10, no. 1, pp. 133–146, 2000.

    Article  Google Scholar 

  17. L. Liu, X. Ye, M. Yao, S. Zhang. A Semantic Description Scheme of Soccer Video Based on MPEG-7. In Proceedings of the 5th Pacific Rim Conference on Multimedia, Lecture Notes in Computer Science, Springer-Verlag, Tokyo, Japan, vol. 3332, pp. 298–305, 2004.

    Google Scholar 

  18. D. W. Tjondronegoro, Y. P. Chen, B. Pham. Classification of Self-consumable Highlights for Soccer Video Summaries. In Proceedings of the 5th IEEE International Conference on Multimedia and Expo, IEEE Press, Taipei, PRC, vol. 1, pp. 579–582, 2004.

    Google Scholar 

  19. G. Jin, L. Tao, G. Xu. Hidden Markov Model Based Events Detection in Soccer Video. In Proceedings of International Conference of Image Analysis and Recognition, Lecture Notes in Computer Science, Springer-Verlag, Porto, Portugal, vol. 3211, pp. 605–612, 2004.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ying Weng.

Additional information

This work was supported by European IST FP6 Research Programme as funded for the Integrated Project: LIVE (No. IST-4-027312).

Ying Weng received the B. Sc. degree from Yunnan University, China, in 1999, and the Ph.D. degree from Chinese Academy of Sciences, Beijing, in 2005. She is currently a postdoctoral student at the University of Bradford, UK. She has published 18 research papers. She is a member of IEEE.

Her research interests include image/video processing, visual information retrieval, Internet video coding, wireless communications, and multimedia transmission.

Jianmin Jiang received the B. Sc. degree from Shandong Mining Institute, China, in 1982, the M. Sc. degree from China University of Mining and Technology in 1984, and the Ph.D. degree from the University of Nottingham, UK, in 1994. From 1985 to 1989, he was a lecturer at Jiangxi University of Technology, China. In 1989, he joined Loughborough University, UK, as a visiting scholar and later moved to the University of Nottingham as a research assistant. In 1992, he was appointed a lecturer of electronics at Bolton Institute, UK, and moved back to Loughborough University in 1995 as a lecturer of computer science. In 1997, he was appointed as a full professor at the School of Computing, University of Glamorgan, Pontypridd, UK. He joined University of Bradford in 2002 as a professor of digital media at the School of Informatics, University of Bradford, UK. He has published more than 200 refereed research papers.

His research interests include visual information retrieval, image/video processing, visual content management, Internet video coding, stereo image coding, and neural network applications.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Weng, Y., Jiang, J. Real-time and automatic close-up retrieval from compressed videos. Int. J. Autom. Comput. 5, 198–201 (2008). https://doi.org/10.1007/s11633-008-0198-5

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11633-008-0198-5

Keywords

Navigation