Skip to main content
Log in

Detection and Recognition of Periodic, Nonrigid Motion

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

The recognition of nonrigid motion, particularly that arising from human movement (and by extension from the locomotory activity of animals) has typically made use of high-level parametric models representing the various body parts (legs, arms, trunk, head etc.) and their connections to each other. Such model-based recognition has been successful in some cases; however, the methods are often difficult to apply to real-world scenes, and are severely limited in their generalizability. The first problem arises from the difficulty of acquiring and tracking the requisite model parts, usually specific joints such as knees, elbows or ankles. This generally requires some prior high-level understanding and segmentation of the scene, or initialization by a human operator. The second problem, with generalization, is due to the fact that the human model is not much good for dogs or birds, and for each new type of motion, a new model must be hand-crafted. In this paper, we show that the recognition of human or animal locomotion, and, in fact, any repetitive activity can be done using low-level, non-parametric representations. Such an approach has the advantage that the same underlying representation is used for all examples, and no individual tailoring of models or prior scene understanding is required. We show in particular, that repetitive motion is such a strong cue, that the moving actor can be segmented, normalized spatially and temporally, and recognized by matching against a spatio-temporal template of motion features. We have implemented a real-time system that can recognize and classify repetitive motion activities in normal gray-scale image sequences. Results on a number of real-world sequences are described.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Allmen, M. and Dyer, C. R. 1990. Cyclic motion detection using spatiotemporal surface and curves. In Proc. Int. Conf. on Pattern Recognition, pp. 365-370.

  • Anderson, C. H., Burt, P. J., and van der Wal, G. S. 1985. Change detection and tracking using pyramid transform techniques. In Proc. SPIE Conference on Intelligent Robots and Computer Vision, pp. 300-305.

  • Badler, N. I. 1975. Temporal scene analysis: Conceptual descriptions of object movements. Ph. D. Thesis, Univ. of Toronto.

  • Chun, H. W. 1986. A representation for temporal sequence and duration in massively parallel networks: Exploiting link connections. In Proc. AAAI.

  • Cutting, J. E. 1981. Six tenets for event perception. Cognition, pp. 71-78.

  • de Sa Virginia, R. 1994. Unsupervised classification learning from cross-modality structure in the environment. Ph. D. Thesis, Computer Science Department, Univ. of Rochester.

  • de Sa Virginia, R. and Ballard, Dana H. 1993. Self-teaching through correlated input. In Computation and Neural Systems 1992, Kluwer Academic, pp. 437-441.

  • Elman, J. E. 1988. Finding structure in time. Technical Report 8801, Center for Research in Language, Univ. of California, San Diego.

    Google Scholar 

  • Ewart, J. P. 1987. Neuroethology of releasing mechanisms: Prey-catching in toads. Behavioral and Brian Sciences, 10:337- 405.

    Google Scholar 

  • Feldman, J. E. 1988. Time, space and form in vision. Technical Report 244, University of Rochester, Computer Science Department.

  • Finn, K. E. and Montgomery, A. A. 1988. Automatic optically-based recognition of speech. Pattern Recognition Letters, 8:159-164.

    Google Scholar 

  • Tsai, R. Y. and Huang, T. S. 1981. Estimating 3-d motion parameters of a rigid planar patch i. IEEE ASSP, 30:525-534.

    Google Scholar 

  • Gould, K. and Shah, M. 1989. The trajectory primal sketch: A multi-scale scheme for representing motion characterestics. In IEEE Conf. Computer Vision and Pattern Recognition, pp. 79-85.

  • Gould, K., Rangarajan, K., and Shah, M. A. 1992. Detection and representation of events in motion trajectories. In Advances in Image Processing and Analysis. Gonzalez and Mahdavieh (Eds.), SPIE Optical Engineering Press.

  • Hildreth, E. C. and Koch, C. 1987. The analysis of visual motion from computational theory to neural mechanisms. Annual Review of Neuroscience.

  • Hoffman, D. D. and Flinchbuagh, B. E. 1982. The interpretation of biological motion. Biological Cybernatics, pp. 195-204.

  • Johansson, G. 1973. Visual perception of biological motion and a model for its analysis. Perception and Psychophysics, 14:201- 211.

    Google Scholar 

  • Juang, B. H. and Rabiner, L. R. 1985. Mixture autoregressive hidden markov models for speech signals. IEEE Trans. Acoustics, Speech and Signal Processing, 6:1404-1413.

    Google Scholar 

  • Kohonen, Teuvo. 1990. Improved versions of learning vector quantization. In IJCNN International Joint Conference on Neural Networks, Vol. 1, pp. I-545-I-550.

    Google Scholar 

  • Koller, D., Heinze, N., and Nagel, H.-H. 1991. Algorithmic characterization of vehicle trajectories from image sequences of motion verbs. In Proc. of IEEE Computer Vision and Pattern Recognition, pp. 90-95.

  • Nelson, R. C. 1991. Qualitative detection of motion by a moving observer. In Proc. of IEEE CVPR, pp. 173-178.

  • O'Rourke, J. and Badler, N. I. 1980. Model-based image analysis of human motion using constraint propagation. PAMI, 3(4):522- 537.

    Google Scholar 

  • Pentland, A. and Mase, K. 1989. Lip reading: Automatic visual recognition of spoken words. Technical Report 117, M. I. T. Media Lab Vision Science.

  • Petajan, E. D., Bischoff, B., and Brooke, N. M. 1988. An improved automatic lipreading system to enhance speech recognition. In SIGCHI'88: Human Factors in Computing Systems, pp. 19-25.

  • Polana, R. and Nelson, R. C. 1992. Temporal texture recognition. In Proc. of CVPR, pp. 129-134.

  • Polana, R. and Nelson, R. C. 1994. Detecting activities. Journal of Visual Communication and Image Representation, 5(2):172- 180.

    Google Scholar 

  • Rashid, R. F. 1980. LIGHTS: A system for interpretation of moving light displays. Ph. D. Thesis, Computer Science Dept, University of Rochester.

  • Rhyne, J. R. and Wolf, C. G. 1986. Gestural interfaces for information processing applications. Technical Report 12179, IBM Research Report.

  • Seitz, S. M. and Dyer, C. R. 1994. Affine invariant detection of periodic motion. In Proceedings of CVPR.

  • Smythe, R. H. 1975. Vision in the Animal World. St. Martin's Press: NY.

    Google Scholar 

  • Takahashi, K., Seki, S., and Oka, R. 1994a. Spotting recognition of human gestures from motion images. In Time-Varying Image Processing and Motion Objects Recognition 3, V. Cappellini (Ed.), Elsevier, pp. 65-72.

  • Takahashi, K., Seki, S., Kojima, H., and Oka, R. 1994b. Recognition of dextrous manipulations from time-varying images. In Proc. IEEE Workshop on Moation of Non-rigid and Articulated Objects, Austin, TX, pp. 23-28.

  • Tank, D. W. and Hopfield, J. J. 1987. Concentrating information in time: Analog neural networks with applications to speech recognition problems. In Proceedings of the First International Conference on Neural Networks, pp. 455-468.

  • Tinbergen, N. 1951. The Study of Instinct. Clarendon Press: Oxford.

    Google Scholar 

  • 'von Frisch', K. 1955. Bees: Their Vision, Taste, Smell and Language. Moscow, IL.

  • Wolf, E. and Zerrahn-Wolf, G. 1936. Flicker and the reactions of bees to flowers. Journal of Gen. Physiol., 20:511-518.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Polana, R., Nelson, R.C. Detection and Recognition of Periodic, Nonrigid Motion. International Journal of Computer Vision 23, 261–282 (1997). https://doi.org/10.1023/A:1007975200487

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1007975200487

Keywords

Navigation