Abstract
This paper proposes a thorough scheme, by virtue of camera zooming descriptor with two-level threshold, to automatically retrieve close-ups directly from moving picture experts group (MPEG) compressed videos based on camera motion analysis. A new algorithm for fast camera motion estimation in compressed domain is presented. In the retrieval process, camera-motion-based semantic retrieval is built. To improve the coverage of the proposed scheme, close-up retrieval in all kinds of videos is investigated. Extensive experiments illustrate that the proposed scheme provides promising retrieval results under real-time and automatic application scenario.
Similar content being viewed by others
References
J. Jiang, Y. Weng, P. J. Li. Dominant Colour Extraction in DCT Domain. Image and Vision Computing Journal, vol. 24, no. 12, pp. 1269–1277, 2006.
J. Jiang, Y. Weng. Video Extraction for Fast Content Access to MPEG Compressed Videos. IEEE Transactions on Circuits and Systems for Video Technology, vol. 14, no. 5, pp. 595–605, 2004.
J. Vendrig, M. Worring. Systematic Evaluation of Logical Story Unit Segmentation. IEEE Transactions on Multimedia, vol. 4, no. 4, pp. 492–499, 2002.
H. W. Agius, M. C. Angelides. Modeling Content for Semantic-level Querying of Multimedia. Multimedia Tools and Applications, vol. 15, no. 1, pp. 5–37, 2001.
T. Athanasiadis, P. Mylonas, Y. Avrithis, S. Kollias. Semantic Image Segmentation and Object Labeling. IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 3, pp. 298–312, 2007.
D. Djordjevic, E. Izquierdo. An Object-and User-driven System for Semantic-based Image Annotation and Retrieval. IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 3, pp. 313–323, 2007.
D. Vallet, P. Castells, M. Fernandez, P. Mylonas, Y. Avrithis. Personalized Content Retrieval in Context Using Ontological Knowledge. IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 3, pp. 336–346, 2007.
J. W. Hsieh, S. L. Yu, Y. S. Chen. Motion-based Video Retrieval by Trajectory Matching. IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 3, pp. 396–409, 2006.
K. W. Sze, K. M. Lam, G. Qiu. A New Key Frame Representation for Video Segment Retrieval. IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 9, pp. 1148–1155, 2005.
F. Jing, M. Li, H. J. Zhang, B. Zhang. Relevance Feedback in Region-based Image Retrieval. IEEE Transactions on Circuits and Systems for Video Technology, vol. 14, no. 5, pp. 672–681, 2004.
S. Antani, R. Kasturi, R. Jain. A Survey on the Use of Pattern Recognition Methods for Abstraction, Indexing and Retrieval of Images and Video. Pattern Recognition, vol. 35, no. 4, pp. 945–965, 2002.
Y. H. Ho, C.W. Lin, J. F. Chen, H. Y. M. Liao. Fast Coarse-to-fine Video Retrieval Using Shot-level Spatio-temporal Statistics. IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 5, pp. 642–648, 2006.
D. Farin, P. H. N. De. With. Enabling Arbitrary Rotational Camera Motion Using Multisprites with Minimum Coding Cost. IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 4, pp. 492–506, 2006.
Y. Su, M. T. Sun, V. Hsu. Global Motion Estimation from Coarsely Sampled Motion Vector Field and the Applications. IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 2, pp. 232–242, 2005.
J. C. Huang, W. S. Hsieh. Automatic Feature-based Global Motion Estimation in Video Sequences. IEEE Transactions on Consumer Electronics, vol. 50, no. 3, pp. 911–915, 2004.
Y. P. Tan, D. D. Saur, S. R. Kulkarni, P. J. Ramadge. Rapid Estimation of Camera Motion from Compressed Video with Application to Video Annotation. IEEE Transactions on Circuits and Systems for Video Technology, vol. 10, no. 1, pp. 133–146, 2000.
L. Liu, X. Ye, M. Yao, S. Zhang. A Semantic Description Scheme of Soccer Video Based on MPEG-7. In Proceedings of the 5th Pacific Rim Conference on Multimedia, Lecture Notes in Computer Science, Springer-Verlag, Tokyo, Japan, vol. 3332, pp. 298–305, 2004.
D. W. Tjondronegoro, Y. P. Chen, B. Pham. Classification of Self-consumable Highlights for Soccer Video Summaries. In Proceedings of the 5th IEEE International Conference on Multimedia and Expo, IEEE Press, Taipei, PRC, vol. 1, pp. 579–582, 2004.
G. Jin, L. Tao, G. Xu. Hidden Markov Model Based Events Detection in Soccer Video. In Proceedings of International Conference of Image Analysis and Recognition, Lecture Notes in Computer Science, Springer-Verlag, Porto, Portugal, vol. 3211, pp. 605–612, 2004.
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was supported by European IST FP6 Research Programme as funded for the Integrated Project: LIVE (No. IST-4-027312).
Ying Weng received the B. Sc. degree from Yunnan University, China, in 1999, and the Ph.D. degree from Chinese Academy of Sciences, Beijing, in 2005. She is currently a postdoctoral student at the University of Bradford, UK. She has published 18 research papers. She is a member of IEEE.
Her research interests include image/video processing, visual information retrieval, Internet video coding, wireless communications, and multimedia transmission.
Jianmin Jiang received the B. Sc. degree from Shandong Mining Institute, China, in 1982, the M. Sc. degree from China University of Mining and Technology in 1984, and the Ph.D. degree from the University of Nottingham, UK, in 1994. From 1985 to 1989, he was a lecturer at Jiangxi University of Technology, China. In 1989, he joined Loughborough University, UK, as a visiting scholar and later moved to the University of Nottingham as a research assistant. In 1992, he was appointed a lecturer of electronics at Bolton Institute, UK, and moved back to Loughborough University in 1995 as a lecturer of computer science. In 1997, he was appointed as a full professor at the School of Computing, University of Glamorgan, Pontypridd, UK. He joined University of Bradford in 2002 as a professor of digital media at the School of Informatics, University of Bradford, UK. He has published more than 200 refereed research papers.
His research interests include visual information retrieval, image/video processing, visual content management, Internet video coding, stereo image coding, and neural network applications.
Rights and permissions
About this article
Cite this article
Weng, Y., Jiang, J. Real-time and automatic close-up retrieval from compressed videos. Int. J. Autom. Comput. 5, 198–201 (2008). https://doi.org/10.1007/s11633-008-0198-5
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11633-008-0198-5