Abstract
Sports video appeals to large audiences due to its high commercial potentials. Automatically extracting useful semantic information and generating highlight summary from sports video to facilitate users’ accessing requirements is an important problem, especially in the forthcoming broadband mobile communication and the need for users to access their multimedia information of interest from anywhere at anytime with their most convenient digital equipments. In this paper, a system to generate highlight summaries oriented for mobile applications is introduced, which includes highlight extraction and video adaptation. In this system, several highlight extraction techniques are provided for field sports video and racket sports video by using multi-modal information. To enhance users’ viewing experience and save bandwidth, 3D animation from highlight segment is also generated. As an important procedure to make video analysis results universally applicable, video transcoding techniques are applied to adapt the video for mobile communication environment and user preference. Experimental results are encouraging and show the advantage and feasibility of the system for multimedia content personalization, enhancement and adaptation to meet different user preference and network/device requirements.
Similar content being viewed by others
References
Assunçno, P., Ghanbari, M., 1996. Post-processing of MPEG-2 Coded Video for Transmission at Lower Bit-rates. Proc. IEEE Int. Conf. Acoustics, Speech and Signal Processing. Atlanta, GA, p.1998–2001.
Bebie, T., Bieri, H., 1998. SoccerMan-reconstructing Soccer Games from Video Sequences. Proc. of ICIP, p.898–902.
Bertini, M., Cucchiara, R., Bimbo, A.D., Prati, A., 2004. Content-based Video Adaptation with User’s Preference. IEEE International Conference on Multimedia and Expo (ICME).
Bjork, N., Christopoulos, C., 1998. Transcoder architectures for video coding. IEEE Trans. Consumer Electron., 44(1):88–98. [doi:10.1109/30.663734]
Cauwenberghs, G., Poggio, T., 2001. Incremental and Decremental Support Vector Machine Learning, Advances in Neural Information Processing Systems. MIT Press, Cambridge, MA, 13:409–415.
Chang, S.F., 2003. Content-Based Video Summarization and Adaptation for Ubiquitous Media Access. Proceedings of the 12th International Conference on Image Analysis and Processing (ICIAP’03).
Ekin, A., Tekalp, A.M., Mehrotra, R., 2003. Automatic soccer video analysis and summarization. IEEE Trans. Image Processing, 12(7):796–807. [doi:10.1109/TIP.2003.812758]
Gong, Y., Lim, T.S., Chua, H.C., 1995. Automatic Parsing of TV Soccer Programs. Proc. IEEE Int. Conf. on Multimedia Computing and Systems.
Hu, B., Zhang, P., Huang, Q., Gao, W., 2005. Reducing Spatial Resolution for MPEG-2 to H.264/AVC Transcoding. Proc. on Pacific-Rim Conference on Multimedia, II:830–840.
Jain, A.K., 2001. Statistical pattern recognition: a review. IEEE Trans. on PAMI, 2:4–37.
Kijak, E., Gravier, G., Gros, P., Oisel, L., Bimbot, F., 2003. HMM Based Structuring of Tennis Videos Using Visual and Audio Cues. Proc. IEEE Int. Conf. Multimedia and Expo.
Leonardi, R., Migliorati, P., Prandini, M., 2004. Semantic indexing of soccer audio-visual sequences: a multimodal approach based on controlled Markov chains. IEEE Trans. Circuits Syst. Video Techn., 14(5):634–643. [doi:10.1109/TCSVT.2004.826751]
Liu, Y., Jiang, S.Q.; Ye, Q.X., Gao, W., Huang, Q.M., 2005a. Playfield Detection Using Adaptive GMM and Its Application. ICASSP2005. Philadelphia, PA, USA.
Liu, Y., Huang, Q., Ye, Q., Gao, W., 2005b. A New Method to Calculate the Camera Focusing Area and Player Position on Playfield in Soccer Video. Proc. of VCIP.
Matsui, K., Iwase, M., Agata, M., Tanaka, T., Ohnishi, N., 1998. Soccer Image Sequence Computed by a Virtual Camera. Proc. of CVPR, p.860–865.
Rui, Y., Gupta, A., Acero, A., 2000. Automatically Extracting Highlights for TV Baseball Programs. Proc. the Eighth ACM Int. Conf. Multimedia, p.105–115.
Shanableh, T., Ghanbari, M., 2000. Heterogeneous video transcoding to lower spatio-temporal resolutions and different encoding formats. IEEE Trans. Multimedia, 2(2):101–110. [doi:10.1109/6046.845014]
Snoek, C.G.M., Worring, M., 2005. Multimodal video indexing: a review of the state-of-the-art. Multimedia Tools and Applications, 25(7):767–775.
Sun, H., Kwok, W., Zdepski, J., 1996. Architectures for MPEG compressed bitstream scaling. IEEE Trans. Circuits Syst. Video Technol., 6(2):191–199. [doi:10.1109/76.488826]
Vetro, A., Haga, T., Sumi, K., Sun, H.F., 2003. Object-Based Coding for Long-Term Archive of Surveillance Video. Proceedings of International Conference on Multimedia and Expo.
Xie, L., Chang, S.F., Divakaran, A., Sun, H., 2003. Structure analysis of soccer video with domain knowledge and hidden Markov models. Pattern Recognition Letters, 24(15):767–775.
Yu, X., Yan, X., Hay, T.S., Leong, H.W., 2004. 3D Reconstruction and Enrichment of Broadcast Soccer Video. Proc. of ACM Multimedia.
Author information
Authors and Affiliations
Additional information
Project supported by NEC Research of China (No. 0P2004001), “Science 100 Plan” of the Chinese Academy of Sciences (No. m2041), and the Natural Science Foundation (No. 4063041) of Beijing, China
Rights and permissions
About this article
Cite this article
Gao, W., Huang, Qm., Jiang, Sq. et al. Sports video summarization and adaptation for application in mobile communication. J. Zhejiang Univ. - Sci. A 7, 819–829 (2006). https://doi.org/10.1631/jzus.2006.A0819
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1631/jzus.2006.A0819