Skip to main content

Recognition of Manual Actions Using Vector Quantization and Dynamic Time Warping

  • Conference paper
Hybrid Artificial Intelligence Systems (HAIS 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6076))

Included in the following conference series:

Abstract

The recognition of manual actions, i.e., hand movements, hand postures and gestures, plays an important role in human-computer interaction, while belonging to a category of particularly difficult tasks. Using a Vicon system to capture 3D spatial data, we investigate the recognition of manual actions in tasks such as pouring a cup of milk and writing into a book. We propose recognizing sequences in multidimensional time-series by first learning a smooth quantization of the data, and then using a variant of dynamic time warping to recognize short sequences of prototypical motions in a long unknown sequence. An experimental analysis validates our approach. Short manual actions are successfully recognized and the approach is shown to be spatially invariant. We also show that the approach speeds up processing while not decreasing recognition performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Vicon motion capture system, http://www.vicon.com/

  2. Caridakis, G., Karpouzis, K., Drosopoulos, A., Kollias, S.: SOMM: Self organizing Markov map for gesture recognition. Pattern Recogn. Lett. 31(1), 52–59 (2009)

    Article  Google Scholar 

  3. Chang, L.Y., Pollard, N.S., Mitchell, T.M., Xing, E.P.: Feature selection for grasp recognition from optical markers. In: Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, pp. 2944–2950 (2007)

    Google Scholar 

  4. Ekvall, S., Kragic, D.: Grasp recognition for programming by demonstration. In: Proc. IEEE Int. Conf. Robotics and Automation, pp. 748–753 (2005)

    Google Scholar 

  5. Gavrila, D.M., Davis, L.S.: 3-D model-based tracking and recognition of human movement. In: Proc. Int. Work. on Face and Gesture Recognition, Zurich, Switzerland (1995)

    Google Scholar 

  6. Jones, E., Oliphant, T., Peterson, P., et al.: SciPy: Open Source scientific tools for Python (2001-2010)

    Google Scholar 

  7. Keogh, E.J., Chu, S., Hart, D., Pazzani, M.J.: An online algorithm for segmenting time series. In: Proc. of the 2001 IEEE Int. Conf. on Data Mining, pp. 289–296 (2001)

    Google Scholar 

  8. Kohonen, T.: The self-organizing map. Proc. IEEE 78(9), 1464–1480 (1990)

    Article  Google Scholar 

  9. Lee, H.-K., Kim, J.H.: An HMM-based threshold model approach for gesture recognition. IEEE Trans. Pattern Anal. Mach. Intell. 21, 961–973 (1999)

    Article  Google Scholar 

  10. Lloyd, S.P.: Least squares quantization in PCM. IEEE Trans. Inform. Theor. 28(2), 129–137 (1982)

    Article  MathSciNet  MATH  Google Scholar 

  11. Martinetz, T., Schulten, K.: A neural-gas network learns topologies. Artificial Neural Networks, 397–402 (1991)

    Google Scholar 

  12. Ritter, H., Robert, H., Röthling, F., Steil, J.J.: Manual intelligence as a Rosetta Stone for robot cognition. In: ISRR (December 2007)

    Google Scholar 

  13. Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. 26, 43–49 (1978)

    Article  MATH  Google Scholar 

  14. Stiefmeier, T., Roggen, D.: Gestures are strings: Efficient online gesture spotting and classification using string matching. In: Proc. of 2nd Int. Conf. on Body Area Networks, BodyNets (2007)

    Google Scholar 

  15. Wendemuth, A.: Grundlagen der stochastischen Sprachverarbeitung, Oldenbourg (2004)

    Google Scholar 

  16. Zito, T., Wilbert, N., Wiskott, L., Berkes, P.: Modular toolkit for Data Processing (MDP): a Python data processing frame work. Frontiers in Neuroinformatics 2, 8 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Martin, M., Maycock, J., Schmidt, F.P., Kramer, O. (2010). Recognition of Manual Actions Using Vector Quantization and Dynamic Time Warping. In: Graña Romay, M., Corchado, E., Garcia Sebastian, M.T. (eds) Hybrid Artificial Intelligence Systems. HAIS 2010. Lecture Notes in Computer Science(), vol 6076. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13769-3_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-13769-3_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-13768-6

  • Online ISBN: 978-3-642-13769-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics