Abstract
This paper describes a new algorithm for detecting cuts, thereby segmenting a video into shots. Our Web‐based video library contains a large volume of news and documentary material; most of the transitions between shots in that type of programming are cuts, rather than dissolves or other complex transitions. We have developed an accurate multi‐attribute algorithm for detecting cuts in video programs. The algorithm uses a motion metric to identify a set of cuts, then uses luminance histograms to eliminate false cuts. Our experimental results show that this algorithm is more accurate than previous motion‐based transition detection algorithms.
Similar content being viewed by others
References
J.S. Boreczky and L.A. Rowe, Comparison of video shot boundary detection techniques, in: Storage and Retrieval for Still Image and Video Databases IV, eds. I.K. Sethi and R.K. Jain, SPIE, Vol. 2670 (1996) pp. 170-179.
A. Dailianas, R.B. Allen and P. England, Comparison of automatic video segmentation algorithms, in: Integration Issues in Large Commercial Media Delivery Systems, eds. A.G. Tescher and V.M. Bove, Jr., SPIE, Vol. 2615 (1996) pp. 1-16.
B.K.P. Horn and B.G. Schunck, Determining optical flow, Artificial Intelligence 17 (1981) 185-203.
K. Mai, J. Miller and R. Zabih, A robust method for detecting cuts and dissolves in video sequences, in: Proceedings of ACM Multimedia '95 (ACM, New York, 1995) pp. 189-200.
A. Nagasaka and Y. Tanaka, Automatic video indexing and full-video search for object appearances, in: Visual Database Systems II, eds. E. Knuth and L. Wegner (Elsevier, Amsterdam, 1992) pp. 113-127.
M. Philips and W. Wolf, Video segmentation techniques for news, in: Multimedia Storage and Archiving Systems, SPIE, Vol. 2916 (1996) pp. 243-251.
B. Shahraray, Scene change detection and content-based sampling of video sequences, in: Digital Video Compression: Algorithms and Technologies, eds. A. Rodriguez, R. Safranek and E. Delp, SPIE, Vol. 2419 (1995) pp. 2-13.
D. Swanberg, C.F. Shu and R. Jain, Knowledge guided parsing and retrieval in video databases, in: Storage and Retrieval for Image and Video Databases, ed. W. Niblack, SPIE, Vol. 1908 (1993) pp. 173-187.
H. Ueda, T. Miyatake and S. Yoshizawa, IMPACT: an interactive natural-motion-picture dedicated multimedia authoring system, in: Proceedings of CHI '91 (ACM, New York, 1991) pp. 343-350.
W. Wolf, Key frame selection by motion analysis, in: Proceedings of ICASSP '96 (IEEE Press, New York, 1996) pp. 1240-1243.
W. Wolf, Hidden Markov model parsing of video programs, in: Proceedings of ICASSP '97 (IEEE Press, New York, 1997) pp. 2609-2611.
W. Wolf, Y. Liang, M. Kozuch, H. Yu, M. Phillips, M. Weekes and A. Debruyne, A digital video library on the World Wide Web, in: Proceedings of ACM Multimedia '96 (ACM, New York, 1996) pp. 433-434.
M. Yeung, B.-L. Yeo, W. Wolf and B. Liu, Video browsing using clustering and scene transitions on compressed sequences, in: Proceedings of SPIE Conference on Multimedia Computing and Networking, SPIE, Vol. 2417 (1995) pp. 399-414.
H.J. Zhang, A. Kankanhalli and S.W. Smoliar, Automatic partitioning of full-motion video, Multimedia Systems 1(1) (1993) 10-28.
H.-J. Zhang, S.W. Smoliar and J.H. Wu, Content-based video browsing tools, in: Proceedings of SPIE Conference on Multimedia Computing and Networking, SPIE, Vol. 2417 (1995) pp. 389-398.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Philips, M., Wolf, W. A multi‐attribute shot segmentation algorithm for video programs. Telecommunication Systems 9, 393–402 (1998). https://doi.org/10.1023/A:1019164327291
Issue Date:
DOI: https://doi.org/10.1023/A:1019164327291