Abstract
We present an original approach for motion-based video retrieval involving partial query. More precisely, we propose a unified statistical framework allowing us to simultaneously extract entities of interest in video shots and supply the associated content-based characterization, which can be used to satisfy partial queries. It relies on the analysis of motion activity in video sequences based on a non-parametric probabilistic modeling of motion information. Areas comprising relevant types of motion activity are extracted from a Markovian region-level labeling applied to the adjacency graph of an initial block-based partition of the image. As a consequence, given a set of videos, we are able to construct a structured base of samples of entities of interest represented by their associated statistical models of motion activity. The retrieval operations is then formulated as a Bayesian inference issue using the MAP criterion. We report different results of extraction of entities of interest in video sequences and examples of retrieval operations performed on a base composed of one hundred video samples.
Similar content being viewed by others
References
D.A. Adjeroh, M.C. Lee, and I. King, “A distance measure for video sequences,” Computer Vision and Image Understanding, Vol. 75, No. 1/2, pp. 25–45, 1999.
P. Aigrain, H-J. Zhang, and D. Petkovic, “Content-based representation and retrieval of visual media: A state-of-the-art review,” Multimedia Tools and Applications, Vol. 3, No. 3, pp. 179–202, 1996.
M. Basseville, “Distance measures for signal processing and pattern recognition,” Signal Processing, Vol. 18, No. 4, pp. 349–369, 1989.
A. Del Bimbo, E.Vicario, and D. Zingoni, “Symbolic description and visual querying of image sequences using spatio-temporal logic,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 7, No. 4, pp. 609–621, 1997.
P. Bouthemy, M. Gelgon, and F. Ganansia, “A unified approach to shot change detection and camera motion characterization,” IEEE Trans. on Circuits and Systems for Video Technology, Vol. 9, No. 7, pp. 1030–1044, 1999.
R. Brunelli, O. Mich, and C.M. Modena, “A survey on the automatic indexing of video data,” Journal of Visual Communication and Image Representation, Vol. 10, No. 2, pp. 78–112, 1999.
S.-F. Chang, W. Chen, H.J. Meng, H. Sundaram, and D. Zhong, “VideoQ-an Automatic content-based video search system using visual cues,” in Proc. ACMMultimedia Conf., Seattle, November 1997, pp. 313–324.
P.B. Chou and C.M. Brown, “The theory and practice of Bayesian image modeling,” International Journal of Computer Vision, Vol. 4, No. 3, pp. 185–210, 1990.
J.D. Courtney, “Automatic video indexing via object motion analysis,” Pattern Recognition, Vol. 30, No. 4, pp. 607–625, 1997.
S. Dagtas, W. Al-Khatib, A. Ghafoor, and R.L. Kashyap, “Models for motion-based video indexing and retrieval,” IEEE Trans. on Image Processing, Vol. 9, No. 1, pp. 88–101, 2000.
Y. Deng and B.S. Manjunath, “Content-based search of video using color, texture and motion,” in Proc. of 4th IEEE Int. Conf. on Image Processing, ICIP'97, Santa-Barbara, October 1997, pp. 543–547.
R. Fablet and P. Bouthemy, “Motion-based feature extraction and ascendant hierarchical classification for video indexing and retrieval,” in Proc. of 3rd Int. Conf. on Visual Information Systems, VISUAL'99, LNCS Vol. 1614, pages 221–228, Amsterdam, June 1999. Springer, pp. 221–228.
R. Fablet and P. Bouthemy, “Statistical motion-based object indexing using optic flow field,” in Proc. of 15th Int. Conf. on Pattern Recognition, ICPR'2000, Vol. 4, Barcelona, September 2000, pp. 287–290.
R. Fablet, P. Bouthemy, and M. Gelgon, “Moving object detection in color image sequences using region-level graph labeling,” in Proc. of 6th IEEE Int. Conf. on Image Processing, ICIP'99, Kobe, October 1999, pp. 939–943.
R. Fablet, P. Bouthemy, and P. Pérez, “Non parametric statistical analysis of scene activity for motion-based video indexing and retrieval,” Technical Report 4005, INRIA, Sept. 2000.
M. Gelgon and P. Bouthemy, “Determining a structured spatiotemporal representation of video content for efficient visualization and indexing,” in Proc. of 5th Eur. Conf. on Computer Vision, ECCV'98, LNCS Vol. 1406, pages 595–609, Freiburg, June 1998. Springer, pp. 595–609.
M. Gelgon and P. Bouthemy, “A region-level motion-based graph representation and labeling for tracking a spatial image region,” Pattern Recognition, Vol. 33, No. 4, pp. 725–745, 2000.
S. Geman and D. Geman, “Stochastic relaxation, Gibbs distribution and the Bayesian restoration of images,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 6, No. 6, pp. 721–741, 1984.
G.L. Gimel'Farb, “Texture modeling by multiple pairwise pixel interactions,” IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 18, No. 11, pp. 1110–1114, 1996.
M. Irani, B. Rousso, and S. Peleg, “Detecting and tracking multiple moving objects using temporal integration,” in Proc. of 2nd Eur. Conf. on Computer Vision, ECCV'92, Santa Margherita, May 1992, pp. 282–287.
A.K. Jain, A. Vailaya, and W. Xiong, “Query by video clip,” Multimedia Systems, Vol. 7, No. 5, pp. 369–384, 1999.
A. Mitiche and P. Bouthemy, “Computation and analysis of image motion: a synopsis of current problems and methods,” International Journal of Computer Vision, Vol. 19, No. 1, pp. 29–55, 1996.
R. Mohan, “Video sequence matching,” in Proc. of Int. Conf. on Acoustics, Speech, and Signal Processing, ICASSP'98, Seattle, May 1998, pp. 3697–3700.
M.R. Naphade, T.T. Kristjansson, B.J. Frey, and T. Huang, “Probabilistic multimedia objects (Multijects): A novel approach to video indexing and retrieval in multimedia systems,” in Proc. of 5th IEEE Int. Conf. on Image Processing, ICIP'98, Chicago, October 1998, pp. 536–545.
C. Nastar, M. Mitschke, and C. Meilhac, “Efficient query refinement for image retrieval,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, CVPR'98, Santa Barbara, June 1998, pp. 547–552.
R. Nelson and R. Polana, “Qualitative recognition of motion using temporal texture,” Computer Vision, Graphics, and Image Processing, Vol. 56, No. 1, pp. 78–99, 1992.
J.M. Odobez and P. Bouthemy, “Robust multiresolution estimation of parametric motion models,” Journal of Visual Communication and Image Representation, Vol. 6, No. 4, pp. 348–365, 1995.
J.M. Odobez and P. Bouthemy, “Separation of moving regions from background in an image sequence acquired with a mobile camera,” inVideo Data Compression for Multimedia Computing, H.H. Li, S. Sun, and H. Derin (Eds), Kluwer, 1997, ch. 8, pp. 295–311.
E. Parzen, “On estimation of probability density function and mode,” Annals Math. Statist., Vol. 33, pp. 1065–1076, 1962.
N. Vasconcelos and A. Lippman, “A probabilistic architecture for content-based image retrieval,” in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition, CVPR'2000, Hilton Head, June 2000, pp. 216–221.
N. Vasconcelos and A. Lippman, “Statistical models of video structure for content analysis and characterization,” IEEE Trans. on Image Processing, Vol. 9, No. 1, pp. 3–19, 2000.
V. Vinod, “Activity based video shot retrieval and ranking,” in Proc. of 14th Int. Conf. on Pattern Recognition, ICPR'98, Brisbane, August 1998, pp. 682–684.
H. Wactlar, T. Kanade, M. Smith, and S. Stevens, “Intelligent access to digital video: The informedia project,” IEEE Computer, Vol. 29, No. 5, pp. 46–52, 1996.
S.C. Zhu, T. Wu, and D. Mumford, “Filters, random fields and maximum entropy (FRAME): towards a unified theory for texture modeling,” International Journal of Computer Vision, Vol. 27, No. 2, pp. 107–126, 1998.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Fablet, R., Bouthemy, P. Non-Parametric Motion Activity Analysis for Statistical Retrieval with Partial Query. Journal of Mathematical Imaging and Vision 14, 257–270 (2001). https://doi.org/10.1023/A:1011238113358
Issue Date:
DOI: https://doi.org/10.1023/A:1011238113358